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Abstract 

We consider the problem of simultaneous user-scheduling, power-allocation, and rate-selection in an 
OFDMA downlink, with the goal of maximizing expected sum-utility under a sum-power constraint. In 
doing so, we consider a family of generic goodput-based utilities that facilitate, e.g., throughput-based 
pricing, quality-of-service enforcement, and/or the treatment of practical modulation-and-coding schemes 
(MCS). Since perfect knowledge of channel state information (CSI) may be difficult to maintain at the 
base-station, especially when the number of users and/or subchannels is large, we consider scheduling 
and resource allocation under imperfect CSI, where the channel state is described by a generic probability 
distribution. First, we consider the "continuous" case where multiple users and/or code rates can time- 
share a single OFDMA subchannel and time slot. This yields a non-convex optimization problem that we 
convert into a convex optimization problem and solve exactly using a dual optimization approach. Second, 
we consider the "discrete" case where only a single user and code rate is allowed per OFDMA subchannel 
per time slot. For the mixed-integer optimization problem that arises, we discuss the connections it has 
with the continuous case and show that it can solved exactly in some situations. For the other situations, 
we present a bound on the optimality gap. For both cases, we provide algorithmic implementations 
of the obtained solution. Finally, we study, numerically, the performance of the proposed algorithms 
under various degrees of CSI uncertainty, utilities, and OFDMA system configurations. In addition, we 
demonstrate advantages relative to existing state-of-the-art algorithms. 
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I. Introduction 

In the downlink of a wireless orthogonal frequency division multiple access (OFDMA) system, the 
base station (BS) delivers data to a pool of users whose channels vary in both time and frequency. Since 
bandwidth and power resources are limited, the BS would like to allocate them most effectively, e.g., 
by pairing users with strong subchannels and distributing power in order to maximize some function of 
the delivered data rates [1]. Although, for resource allocation, one would ideally like to have access to 
instantaneous channel state information (CSI), such CSI is difficult to obtain in practice, and so resource 
allocation must be accomplished under imperfect CSI. Thus, in this paper, we consider simultaneous user- 
scheduling, power-allocation, and rate-selection in an OFDMA downlink, given only a generic distribution 
for the subchannel signal-to-noise ratios (SNRs), with the goal of maximizing expected sum-utility under a 
sum-power constraint. In doing so, we consider relatively generic goodput-based utilities, facilitating, e.g., 
throughput-based pricing (e.g., [2]-[4]), quality-of-service enforcement, and/or the treatment of practical 
modulation-and-coding schemes (MCS). 

In particular, we consider the above scheduling and resource allocation (SRA) problem under two 
scenarios. In the first scenario, we allow multiple users (and/or MCSs) to time-share any given sub- 
channel and time-slot. In practice, this scenario occurs, e.g., in OFDMA systems where several users 
are multiplexed within a time-slot, such as IEEE 802.16/WiMAX [5] and 3GPP LTE [6]. Although the 
resulting optimization problem is non-convex, we show that it can be converted into a convex problem and 
solved exactly using a dual optimization approach. Based on a detailed analysis of the optimal solution, 
we propose a novel bisection-based algorithm that is faster than state-of-the-art golden-section based 
approaches (e.g., [7]) and that admits finite-iteration performance guarantees. In the second scenario, 
we allow at most one combination of user and MCS to be used on any given subchannel and time-slot. 
This scenario occurs widely in practice, such as in the Dedicated Traffic Channel (DTCH) mode of 
UMTS -LTE [8], and results in a mixed-integer optimization problem. Based on a detailed analysis of 
the optimal solution to this problem and its relationship to that in the first scenario, we propose a novel 
suboptimal algorithm that is faster than state-of-the-art golden-section and subgradient based approaches 
(e.g., [7], [9]), and we derive a novel tight bound on the optimality gap of our algorithm. Finally, we 
simulate our algorithms under various OFDMA system configurations, comparing against state-of-the-art 
approaches and genie-aided performance bounds. 

We now discuss related work. The problem of OFDMA downlink SRA under perfect CSI has been 
studied in several papers, notably [10]— [15]. In [10], a utility maximization framework for discrete 
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allocation was formulated to balance system efficiency and fairness, and efficient subgradient-based 
algorithms were proposed. In [11], a subchannel, rate, and power allocation algorithm was developed to 
minimize power consumption while maintaining a total rate-allocation requirement for every user. In [14], 
a weighted-sum capacity maximization problem with/without subchannel sharing was formulated to 
allocate subcarriers and powers. In [15], non-convex optimization problems regarding weighted sum-rate 
maximization and weighted sum-power minimization were solved using a Lagrange dual decomposition 
method. Compared to the above works, we extend the utility maximization framework to imperfect CSI 
and continuous allocations, and propose bisection-based algorithms that are faster for both the discrete 
and continuous allocation scenarios. Unlike [10]— [15], our utility framework can be applied to problems 
with/without fixed rate-power function!]- In additional, it can be applied to pricing-based utilities (e.g., 
responsive pricing and proportional fairness pricing) [2]. Furthermore, we study the relationship between 
the discrete and continuous allocation scenarios, and provide a tight bound on the duality gap of our 
proposed discrete-allocation scheme. 

The problem of OFDMA downlink SRA under imperfect CSI was studied in several papers, notably 
[7], [9], [16], [17]. In [9], the authors considered the problem of discrete ergodic weighted sum-rate 
maximization for user scheduling and resource allocation, and studied the impact of channel estimation 
error due to pilot-aided MMSE channel estimation. In [7], a deterministic optimization problem was 
formulated using an upper bound on system capacity (via Jensen's inequality) as the objective. Both 
optimal and heuristic algorithms were then proposed to implement the obtained solution. Compared to 
these two works, we propose faster algorithms, applicable to a general utility maximization framework 
(of which the objectives in [7], [9] are special cases), under a more general class of channel estimators, 
and for both discrete and continuous subchannel allocations. Our algorithms are inspired by a rigorous 
analysis of the optimal solutions to the discrete and continuous problems. In [16], the problem of total 
transmit power minimization, subject to strict constraints on conditional expected user capacities, was 
investigated. In [17], the effect of heterogeneous delay requirements and outdated CSI on a particular 
discrete resource allocation problem was studied. In contrast, we consider a general utility maximization 
problem that allows us to attack problems that may or may not be based on fixed rate-power functions, 
as well as those based on pricing models. Relative to these works, we propose faster algorithms for both 
continuous and discrete allocation problems with provable bounds on their performances. 

The remainder of this paper is organized as follows. In Section HH we outline the system model and 

'By a "fixed rate-power function" we mean that, for a given SNR, the achievable rate is a known function of the power. 
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frame our optimization problems. In Section [TTlJ we consider the "continuous" problem, where each 
subchannel can be shared by multiple users and rates, and find its exact solution. In Section [TV] we 
consider the "discrete" problem, where each subchannel can support at most one combination of user 
and rate per time slot. In Section |Vj we compare the performance of the proposed algorithms to reference 
algorithms under various settings. Finally, in Section |VIJ we conclude. 

II. System Model 

We consider a downlink OFDMA system with N subchannels and K active users (N, K S Z + ) as 
shown in Fig. [TJ The scheduler-and-resource-allocator at the base-station uses the imperfect CSI to send 
data to the users, across OFDMA subchannels, in a way that maximizes utility. We assume that, for each 
user, there is an infinite backlog of data at the base-station, so that there is always data available to be 
transmitted. During every channel use and across every OFDMA subchannel, the base-station transmits 
codeword(s) from a generic signaling scheme, which propagate to the intended mobile recipient(s) 
through their respective fading channels. For a given user k, the OFDMA subchannels are assumed 
to be non-interfering, with gains that are time-invariant over each codeword duration and statistically 
independent of those for other users. Thus, the successful reception of a transmitted codeword depends 
on the corresponding subchannel's SNR 7, power p, and modulation and coding scheme (MCS), indexed 
by m € {1, ... , M}. We assume that, for user k, MCS m corresponds to a transmission rate of m bits 
per codeword and a codeword error probability of ek, m {pi) = Ojfc.me ,mP7 for known constants ak, m 
and bk, m (see, e.g., [9]). Here, the subchannel SNR 7 is treated as an exogenous parameter, so that pj 
is the effective received SNR. 

To precisely state our scheduling and resource allocation (SRA) problem, some additional notation is 
useful. To indicate how subchannels are partitioned among users and rates in each time-slot, we will use 
the proportionality indicator I n> k m , where I n % m = 1 means that subchannel n is fully dedicated to user 
k at MCS m, and 

In,k,m — means that subchannel n is totally unavailable to user k at MCS m. The 
subchannel resource constraint is then expressed as I n ,k,m < 1 f° r au n - m the sequel, we consider 

two flavors of the SRA problem, a "continuous" one where each subchannel can be shared among multiple 
users and/or rates per time slot (i.e., I n ^,m G [0, 1]), and a "discrete" one where each subchannel can be 
allocated to at most one user/rate combination per time slot (i.e., l n ^,,m £ {0) !})■ We will use p n ^,m > 
as the power that would be expended on subchannel n if it was fully allocated to the user/rate combination 
(k, m). With this definition, the total expended power becomes J2 n k m In,k,mPn,k,m- Finally, we will use 
7 n fc to denote the n th subchannel's SNR for user k. Although we assume that the BS does not know the 
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SNR realizations {^ n ,k}, we assume that it does know the (marginal) distribution of each j n k- 

When subchannel n is fully dedicated to user k with MCS m and power p n ,k,mi tne goodput g n: k,m = 
(1 — afc. m e~ bfc mP " fc m7n fc )r / fc m quantifies the expected number of bits, per codeword, transmitted without 
error. In the sequel, we focus on maximizing goodput-based utilities of the form U n)kjm (g n>k)m ), where 
U n ,k, m (') is an y generic real-valued function that is twice differentiable, strictly-increasing, and concave, 
with £/"n,fc,m(0) < oo. (These conditions imply U' nkm {-) > and U^ km (-) < 0.) In particular, we aim 
to maximize the expected sum utility, E { J2 n k m In,k,mU n ,k,m(9n,k,m)}> where the expectation is taken 
over the subchannel-SNRs {j nk } hidden within the goodputs. Incorporating a sum-power constraint of 
Peon, our SRA problem becomes 

{N K M \ 

s -t- ^ ] In,k,m — 1 and In,k,mPn,k,rn — -fcon- 

k,m n,k,m 

The above formulation is sufficiently general to address a wide class of objectives. For example, to 
maximize sum-goodput, one would simply use J7 n ,fc,mG?) = 9- F° r weighted sum-goodput, one would 
instead choose U n ^^ m {g) = w^g with appropriately chosen weights {w^}. To maximize weighted sum 
capacity w k I n>kjl log (l + p n ,k,ijn,k) , as in [9], one would choose M = a k)1 = b kjl = r k)1 = 1, 
and set U n ^ k) \(g) = w k log (l — log(l — g)) for g G [0, 1). Commonly used utilities constructed from 
concave functions of capacity log(l + p n ,fc,i7n,fc)> sucn as max-min fairness and the utilities in [10] and 
[7], can also be handled by our formulation. For example, the utility U n>ktm (g) = 1 — e~ Wk9 (for some 
positive {w k }) is appropriate for "elastic" applications such as file transfer [3], [4]. Our formulation also 
supports various pricing models [2], such as flat-pricing, responsive pricing, proportional fairness pricing, 
and effective-bandwidth pricing. 

Next, in Section|IIIl we study the SRA problem for the continuous case I n ,k,m £ [0, 1], and in SectionllVl 
we study it for the discrete case I n , k ,m £ {0, 1}. 

III. Optimal Scheduling and Resource Allocation with subchannel sharing 

In this section, we address the SRA problem in the case where l n ^, m £ [0, 1] V(n, k, m). Recall that this 
problem arises when sharing of any subchannel by multiple users and/or multiple MCS combinations 
is allowed. We refer to this problem as the "continuous scheduling and resource allocation" (CSRA) 
problem. Defining I as the N x K x M matrix with (n, k, m) th element as I n ,k,m and the domain of i" 
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as 

Xcsra :={l:le [0,ir xirxM , E fc , m W < 1 Vn}, 
the CSRA problem can be stated as 

CSRA := min - V/„ if:)m E U/ nAm ((l - a Km e~ bk - mPn - k ' mln - k )r Km ) \ s.t. Y] I n ,k,mPn,k,m < ^con- 
{p»,*,m>0} y J n , 

n,k,m n,k,m {Z) 

I 6-t-CSRA 

This problem has a non-convex constraint set, making it a non-convex optimization problem. In order to 
convert it into a convex optimization problem, we write the "actual" power allocated to user k at MCS 
m on subchannel n as x n = I n ,k,mPn,k,m- Then, the problem becomes 

CSRA = min > In±,mFn,k,m\Jn,k,mi%n,k,m) S.t. > X n h m < -fconi (3) 
{x„. fe ,,„>0} V V 
n,k,m n,k,m 

1 6-icSRA 

where F„ )fcjm (-, •) is given by 

_ J-E m ((l - a fe , m e- 6 *^».^».*/ J »-*-)r fe)m )} if J n , fe)m / 

Fn,k,m\In,k,rni •^n,k,m) \ (4) 

I otherwise. 
The modified problem in © is a convex optimization problem with a convex objective function and 
linear inequality constraint. Moreover, Slater's condition is satisfied at I n> k, m = 9 KM an( * x n,k,m = 
^ L In,k,mi Vn, k, m. Hence, the solution of (© is the same as that of its dual problem (i.e., zero duality 
gap) [18]. Let us denote the optimal i" and x for ([3]) by -Z^csra an( ^ ^CSRA' respectively, and let Pq SRA 
be the corresponding p. 

Writing the dual formulation, using \i as the dual variable, the Lagrangian of © is 

L^fJ'jIjX^ — ^ ^ In,k,m Fn,k,m(In,k,mi %n,k,m) ~i~ ( ^ ^ %n,k,m ^"conj (5) 
n,k,m n,k,m 

where we use x to denote the NxKx M matrix [x ni k, m ]- The corresponding unconstrained dual problem, 
then, becomes 

max min L(u,I,x) (6) 

H>Q x^O 

■fG^CSRA 

= max min L(/i, I, x*(u, I)) = maxL(/i, x*(fi, I*(u,))) = L(a*, /*(//*), x*(^*, /*(//*))), 

m>o /eicsRA m>o 

where a; ^ means that x n ^, m > \/n,k,m, x*(fj,,I) denotes the optimal cc for a given /i and I, 
£ ^csra denotes the optimal i" for a given \i, and /x* denotes the optimal \i. 
In the next few subsections, we will optimize the Lagrangian according to © w.r.t. x, I, and fi 
in Section IIII-AI Section IIII-BI and Section IIII-CI respectively. We then propose an iterative algorithm 
to solve CSRA problem in Section IIII-DI Finally, we discuss some important properties of the CSRA 
solution in Section IIII-EI 
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A. Optimizing over total powers, x, for a given [i and user-MCS allocation matrix I 

The Lagrangian in © is a convex function of x. Therefore, any local minimum of the function is a 
global minimum. Calculating the derivative of L(/i, J, x) w.r.t. x nkm , we get 



dL(n, I, x) 



if L 



n,k,m 







dx 



n,k,m 



/^-a fc , m 6 fc , m r fcim E{[/; fcm ((l-a fcim e- f ' fc -^--^.'=/ J --™) ( 7 ) 

otherwise. 



k,m^n.,k,m'fn,k/In 



Clearly, if I n .k,m = 0, then L(-,-,-) is an increasing^ function of x n ,k,m since // > 0. Therefore, 
x n,fc,m(^'- r ) = °- But if I n,k,m / 0, then ^g^ff is an increasing function of x n>fc ,m since U' nkrn {-) is 
a decreasing function of x nikm . Thus, we have 



-bk, m Xn,k,m.'1n,k/ 



In,k, m \ = (8) 



for some positive x nifcjm if and only if < \i < a k>m b k>m r ktm U^ m ((l-a k>m )r k>m ) E{7„ i/t }. Therefore, 

f 

if o < ) E{7n,fc} 



x n,k,m\P J i -0 ~ * 



(9) 







otherwise, 



where x nkm (fj,, J) satisfies 



-bk,min,k,m{lJ;I)jn,k/ In.k,: 



')rk,m)jn 



-bk, m in,k,m,(n,I)jn,k/ In.k,: 



From CG3, we observe that x njkm (/j,,I) = p n k m (fi) I n k , m > where Pn,k,m{l>) satisfies 



M — a k,mbk,m r k,m E {L^ fc m ((l — afe )jn e' 



-bk, m Pn,k,m(l^)'Yn,t 



)rk,m)ln 



fee 



-b k , m p n , k , m (n)'Y-n,k 



Combining the above observations, we can write for any J G 2^csra an d (ra, fc, m) that 

x n,k,m{l L i I) = ^n,k,mPn,k,m(f J ')i 



(10) 



}. (11) 



(12) 



where 



Pn,fc,m(/ i ) = < 



(13) 



Cm) if o < < a k 

) E{7n,fc} 

otherwise, 

and p n ,k,m{p) satisfies (ITTb . Note that if such a p n ,k,m{p) exists that satisfies (TTTI) . then it is unique. 
This is because, in (fTTT) . U' nkm {-) is a continuous decreasing positive function and e^ bk - m ^ n - k - m ^ ln - k 



We use the terms "increasing" and "decreasing" interchangeably with "non-decreasing" and "non-increasing", respectively. 
The terms "strictly-increasing" and "strictly-decreasing" are used when appropriate. 
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Fig. 1 : System model of a downlink OFDMA system with 
N subchannels and K users. Here, n is the subchannel 
index. 
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Fig. 2: Prototypical plot of p* k m (fJ.) as a function of 
fi. The choice of system parameters are the same as those 
used in Section Pvl 



is a strictly-decreasing continuous function of p n ,k,m{lA' which makes the right side of (TTTb a strictly- 
decreasing continuous function of p n fc im (ju). Therefore, in the domain of its existence, p n ,k,m{^) is unique 
and decreases continuously with increase in /i. Consequently, (//,/) is a decreasing continuous 

function of fi. Figure [2] shows an example of the variation of P^kmi^) wxt - f 1 ' 

B. Optimizing over user-MCS allocation matrix I for a given [i 
Substituting x*{n, I) from (ITZb into (O, we get the Lagrangian 

L(fi, I, x*(fi, I)) v n , k , m (W n , k , m M) 



n k,m 



,(14) 



where I n = {I n km V(/c,m)}. Since the above Lagrangian contains the sum of L n (/j,,I n ) over n, 
minimizing L n (fi, I n ) for every n (over all possible I n ) minimizes the Lagrangian. Recall that L n (fj,, I n ) 
is a linear function of {I n ,k,m V(fc, m)} that satisfies J2k I n .k,m — 1- Therefore, L n (fi, I n ) is minimized 
by the J n that gives maximum possible weight to the (k, m) combination with the most negative value 
°f ^n,fc,m(M)Pn fc m(^))- ^° wr i te tms mathematically, let us define, for each \x and subchannel n, a set 
of participating user-MCS combinations that yield the same most-negative value of V^fc m (//,]?* k m (/-t)) 
over all (k, m) as follows: 

s n(fJ>) = \ (k,m) : (k,m) = avgmmV n ^^(fi,p^ k , m ,(fi)), and V n>k;m (iJ,,p* krn (fj,)) < 0\. (15) 

k (k',m') J 
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If S n (/i) is a null or a singleton set, then the optimal allocation on subchannel n is given by 

„ , ^ (l if (k,m)eS n (») 

£,*,»» = S ( 16 ) 

I otherwise. 

However, if |S n (//)| > 1 (where \S n (/i)\ denotes the cardinality of £„(//)), then multiple (k,m) combina- 
tions contribute equally towards the minimum value of L n (p,, I), and thus the optimum can be reached by 
sharing subchannel n. In particular, let us suppose that S n (p,) = {(^i(n),mi(n)), . . . , (fagr^ (n), migr^t (n))}. 
Then, the optimal allocation of subchannel n is given by 



InMnlmJn) if (K m ) = ft(n),m t («)) for some i G {1, . . . , \S n (fJ,)\} 

(17) 



otherwise, 

where the vector (4 ifcl ( n ) imi(n ), . . . , 4,fc |S „ (fl) |(n),m |S „ (M) |(n)) is any point in the unit-(|5„(/i)| - 1) simplex, 
i.e., it belongs to the space [0,1]I s »(m)I and satisfies 

|S»0*)l 

^ ^n,fc(w),m«(n) = 1- (18) 
i=l 

C. Optimizing over // 

In order to optimize over \i, we can calculate the Lagrangian optimized for a given value of [i as 

L{fjL,r^),x*(jjL,r(jjL))) 



-^con+ XI T n,k,M -v{u nAm ((l-a k , m e fefc "" p "' fc - {/i)7 "' fc )r fcim )}+ W ; ifcim (/i) 



n,fc,m 



,(19) 



and then maximize it over all possible values of \i > to find /x*. Notice from (fl6l)-([T8T) that we have 
J2 k m I* k m {^*) = 1 for at least one n. Otherwise, I* (/J,*) = which, clearly, is not the optimal solution. 
Therefore, \i* > /i m j n > 0, where 

fi m \n = min a Km b Km r Km E {U' n k m ((l - a k)m e- bk - mP ^ k )r ktin ) 7 ^ k e- bk ^ P ^- k } (20) 

is obtained by taking p n ,k,m{v) ~^ -Peon for all (n,k,m) in the right side of (ITTb . Since P* nkm {ii) is a 
decreasing continuous function of /x (seen in Section HIl-AI ). we have ^ n km x nk mO^' -0 > ^con for all 
/ / and ^ < jUmi n . We can also obtain an upper bound fj,* < /i m ax> where 

/^max = max a kjm ,mTk,mU nkrn y(l CLk,m)fk,m 

E{ 7n , fc } (21) 

is obtained by taking p n ,k,m 

(fi) — > in the right side of (fTTb - Thus, for any \i > /i ma x, we have that 
x* n km (p>, I) = \/n,k,m, I. Since the primal objective in ([3]) is certainly not maximized when zero 
power is allocated on all subchannels, we have fi* € [/%iiri, ^max] C (0, oo). 
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At the optimal \i, i.e., fx*, if we have \S n (fj,*)\ < 1 Vn, then the optimal CSRA allocation, /qsrA' 
equals J*(/x*) and can be calculated using (fT6l ). Moreover, the optimal power allocation p£ SRA allocates 



Pn,k,m,CSRA 



otherwise 

to every possible (n, k,m) combination. However, if for some n, we have \S n (/j,*)\ > 1, then ambiguity 
arises due to multiple possibilities of I* (//*) obtained via (fTTT ). In order to find the optimal user-MCS 
allocation in such cases, we use the fact that the CSRA problem in ([3]) is a convex optimization problem 
whose exact solution satisfies the sum-power constraint with equality, i.e., 

^ x n,k,m(^* ^* (f 1 *)) = ^n,k,m (A 4 * )P*n,k,m (Z 2 * ) = ^OOn- (23) 

n,k,m n,k,m 

This is because fi* > fi mm > (shown earlier) and the complementary slackness condition gives that 
A**( J2 n km x nk mO"* > (m*) ) — ^con) = 0. Now, recall that the total power allocated to any subchannel n 
at ^* is EEi } ' 4,fc 1 (n),m 1 (n)K,fc ! (n),m ! (n)(/ 1 *) where (n) satisfies CED- This quantity 

is dependent on the choice of values for {/ n ,fci(n),m,(n)}i="i^ ^ an d takes on any value between an upper 
and lower bound given by the following equation: 

minPn^nWn)^*) < 2 / n,fc,(n), m ,(n)K,fc, W , m ,( n )(/i*) < m f x K.fcW,™,^)^*)' ( 24 ) 

Note that the existence of at least one i" = I*(fi*) satisfying 

^ yi 7 n,fci(n.),"H(n) K,fc I (n),m. i (n)(/ J '*) = ^con (25) 
n i 

is guaranteed by the optimality of the dual solution (of our convex CSRA problem over a closed constraint 
set). Therefore, we necessarily have £ n min^^^^/i*) < P con , and £ n maxip* ifct(n)jmi(n) (/i*) > 
P con . In addition, all choices of user-MCS allocations, given by (fTTT ) that satisfy the equality 

En,fc, m / n,fc,m(^*)Pn,fc,m(/^*) = P con, are optimal for the CSRA problem. 

In the case that the optimal solution is non-unique, i.e., \S n (fi*)\ > 1 for some n, then one 

instance of can be found as follows. For each subchannel n, define 

(k max (n,fi*),m max (n,fi*)) := argmaxp* fc . (n)mi(n) (//), (26) 

i 

(fc min (rc,//),m min (n,//)) := argmmp* fc . (n)m . (n) (//), (27) 

i 

and find the value of A G [0, 1] for which 

^(X^^^in^.M^.mminCn.M*)^*)) + ( X ~ ^) ( zC Pn,k max (n,ti* ),m max (n lAt * ) (M* )) = ^con, (28) 

n n 
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i.e., 

(n,/i*)(/^ ) -Peon 

Now, defining two specific allocations, J min (/i*) and I max (fi*), as 

Jl (fc,m) = (k mi „(n,n*),m mn {n,ii*)) J 1 (fc, m) = (fc max (n, /i*), m max (n, /i*)) 

^n,k,m\^ ) — \ J ^n,k,m\^ ) ~ \ 

I otherwise, I otherwise, (30) 

respectively, the optimal user-MCS allocation is given by Iqsra = AI min (^*) + (1 - A)I max (/i*). The 
corresponding optimal power allocation is then given by d22l ). It can be seen that this solution satisfies 
the subchannel constraint as well as the sum power constraint with equality, i.e., 

5^ ^n^mif 1 )Pn,k,m(l L ) = X n,k,m(f 1 )) = -fcon- 

n,k,m n,k,m 

Two interesting observations can be made from the above discussion. Firstly, for any choice of concave 
utility functions U n ^ m {-), there exists an optimal scheduling and resource allocation strategy that allocates 
each subchannel to at most 2 user-MCS combinations. Therefore, when allocating N subchannels, even 
if more than 2N user-MCS options are available, at most 2N such options will be used. Secondly, if 
I mm (/i*) = I mm (fi*), then the exact CSRA solution allocates power to at most one (k,m) combination 
for every subchannel, i.e., no subchannel is shared among any two or more user-MCS combinations. This 
observation will motivate the SRA problem's solution without subchannel sharing in Section [TV] 



D. Algorithmic implementation 

In practice, it is not possible to search exhaustively over \i G [/^min , ^max] ■ Thus, we propose an 
algorithm to reach solutions in close (and adjustable) proximity to the optimal. The algorithm first narrows 
down the location of p,* using a bisection-search over [^ m i n , Mmax] for the optimum total power allocation, 
and then finds a set of resource allocations (I, x) that achieve a total utility close to the optimal. 

To proceed in this direction, with the aim of developing a framework to do bisection-search over p, 
let us define the total optimal allocated power for a given value of p as follows: 

^totO) - ^.roO^ 1 *^)). (31) 

n,k,m 

where I*{p) and x*(p, I*(p)) (defined in Q) minimize the Lagrangian (defined in ©) for a given p. 
The following lemma relates the variation of X* oX (p) with respect to p. 

Lemma 1. The total optimal power allocation, X* oi (p), is a monotonically decreasing function of p. 
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Proof: A proof sketch is given in Appendix |B] For the full proof, see [19]. ■ 
A sample plot of X* oX {fx) and L(fx, I*(fx), x*(fx, I*(fx))) as a function of fx is shown in Figure [3] 
From the figure, three observations can be made. First, as fx increases, the optimal total allocated power 
decreases, as expected from Lemma [TJ Second, as expected, the Lagrangian is maximized for that value 
of fx at which X* oX (fx) = P CO n- Third, the optimal total power allocation varies continuously in the 
region of fx where the optimal allocation, I* (fx), remains constant and takes a jump (negative) when 
I*(fx) changes. This happens for the following reason. We know, for any (n,k,m), that P^kmi^) * s 
a continuous function of fx. Thus, when the optimal allocation remains constant over a range of fx, the 
total power allocated, Yl n km^nk mkl^Pn k m(A i ) a l so varies continuously with fx. However, at the point 
of discontinuity (say jx), multiple optimal allocations achieve the same optimal value of Lagrangian. In 
other words, \S n (jx)\ > 1 for some n. In that case, X* oX (jx) can take any value in the interval 

^2nPn,k min (n),m min (n)(&' ^nP n ,k maK (n),m max {n)^ 

while achieving the same minimum value of the Lagrangian at jx. Applying Lemma [TJ we have 

n n 

for any Ai, Ao > 0, causing a jump of ( V* , r \ , \(jx) — 5~L V* , r \ < in the total 

{n),m m i„in) v > n,K max (n),in max (n) \> >i 

optimal power allocation at jx. 

Lemma [TJ allows us to do a bisection-search over fx since X XoX (fx) is a decreasing function of fx and 
the optimal fx is the one at which X XoX (fx) = P C on- In particular, if fx* G [fi,jx] for some fx and jx, then 
fx* G ^r^A* if X* oi f^pj > Peon, otherwise fx* G fx, . Using this concept, we propose an 
algorithm in Table U that finds an interval [fx,p], such that \x* G [p,jj] and jx — fx < k, where k (> 0) is 
a tuning-parameter, and allocates resources based on optimal resource allocations at fx and jx. 

The following lemma characterizes the relationship between the tuning parameter k and the accuracy 
of the obtained solution. 

Lemma 2. Let fx* 6 [fx, jx] be the point where the proposed CSRA algorithm stops, and the total utility 
obtained by the proposed algorithm and the exact CSRA solution be C/csra(/^> jj) an d ^csra> respectively. 
Then, < U£ SRA ~ ^CSRa(/£, p) < ((1 - /J.)P C on- 

Proof: For proof, see Appendix O ■ 
Since our algorithm stops when jx — fx < k, from Lemma |2j the gap between the obtained utility and 
the optimal utility is bounded by P CO nK. Moreover, lim^-^ Ucsr/k^, jj) = ^csra- 
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TABLE I: Algorithmic implementations of the proposed algorithms 



Proposed CSRA algorithm 



Brute force algorithm for a given J 



1) Set p = {j, min , p = /Umax, and /i = . 

2) For each subchannel n = 1, . . . , N: 

a) For each (k, m), 

i) Use dill ) and d!3t to calculate p£ *, m (/i). 

ii) Use (O to calculate V„,k, m (fM,pZ„k,m(p))- 

b) Calculate S n (p) using J15> . 

3) If (//) | < 1 Vn, then find I* (p.) using <16t , else use 
(3D) and set I*(/x) = I min (At). 

4) Find x*(p,I*(p)) using j 1 2b and calculate Xf oi (p) = 

E, a , m <,t,m(ftJ»)- 

5) If Jf t o t (/x) > P CO n, set fi = n, otherwise set p = p, 

6) If /i — /i > ft, go to step 2), else proceed. 

7) Now we have /ix* £ [/z, /5] and p — fi < k. If X t o t (/z) 7^ 



X t ot(ju), set A 



*tot(jj)- 



else set A = 0. 



8) The optimal user-MCS allocation is given by Jcsra — 
XI*(p) + (1 — A)7*(/tt) and the corresponding opti- 
mal x is given by x C sra = Xx*(p, I*(p)) + (1 — 
\)x* (n, I* (fi)). The optimal power allocation, p C SRA' 
then can be found using 

m.CSRA ;f f i n 

7" 11 1 n,k,m, CSRA 7^ U ,™ 

i,CSRA='i 7 n,fc, m ,CSRA (32) 

otherwise, 



where /„, fc , m ,csRA 



and £„, fc , m ,csRA denote the 
(n, fc, m) th component of icsRA and aicSRA> respec- 
tively. Notice that the obtained solution satisfies the 
sum-power constraint with equality. 



1) 
2) 
3) 
4) 
5) 
6) 

7) 



Initialize p = ^ min and p = fi max . 
Set /i = =2~- 

For each (n, fc, m), use J39t-J4 1 1> to obtain x* n h m (p). 
Find X* ot (I,n) using f42t . 

If X t o t (J, fi) > P con , set ^ — (x, otherwise set p — p. 
If p — fj, < k, go to step 7), otherwise go to step 2). 

x t * t (j, M )-_p con 



If X t * t (/,/x) ^ X t * ot (/,£i), set A 
otherwise set A = 0. 
Set pi — p. The best actual power allocation is given 
by xi — Xx*(p) + (1 — X)x*(fi) and the best power 
allocation, p { , is given by 



Pn,k : m,I 







if In,k,m 7^ 

otherwise, 



where p n ,k, m ,i and x n ,k,m,i are the (n,k,m) th ele- 
ment of pj and xi, respectively. The corresponding 
Lagrangian, found using Li — Lj(/2, p*(/x)), gives the 
optimal Lagrangian value. 



Proposed DSRA algorithm 



1) Use the algorithmic implementation of the proposed 
CSRA solution in to find I*(p) and I* (p.), where the 
optimal /i for the CSRA problem, i.e., ^1* lies in the set 
[p_, p], p - p < k, and I* (fi) , I* (p) £ Xdsra- 

2) For both 1 = 1* (p) and 1 = 1* (p) (since they may 
differ), calculate pj and Lj as described for the brute 
force algorithm. 

3) Choose /dsra = argmin Jg{/ , (ii)i as the 
user-MCS allocation and Pdsra = P/ DSRA as the as- 
sociated power allocation. 



The proposed algorithm requires at most [log 2 ( / ' max ~ Atmin )] iterations of fi in order to find /2, and \i 
such that j2 — fi<K and //* 6 [/i, p]. Therefore, measuring the complexity of the algorithm by the number 
of times (fTTT) must be solved for a given (n,k,m, fj,), the proposed algorithm takes at most 



NKM \\og 2 ( *""~ /in *' )l 



(33) 



steps. We use this method of measuring complexity because it allows us to easily compare all algorithms 
in the paper. Note that, for a given k, the number of steps taken by the proposed bisection algorithm is 
proportional to log 2 n. 



June 30, 2011 



DRAFT 



14 



E. Some properties of the CSRA solution 

In this subsection, we study a few properties of the CSRA solution that yield valuable insights into 
the optimal resource allocation strategy for any given value of Lagrange multiplier, 11. Let us fix a 
A G [/Wmini ^max]- Now, if \S n (fl)\ < 1, Vn, then the optimal allocation at fi, I*(fi), is given by (fT6l ). which 
reveals that !*(/}) G {0, l}^*^*^ ^ ^ 

case, the definition of Zqsra implies that every subchannel 
is allocated to at most one user-MCS combination. Note that this is precisely the constraint we impose 
in the later part of this paper. Let us now consider the case where it is possible that \S n (jl)\ > 1 for 
some n. 

Lemma 3. For any p, > 0, there exists a 5 > such that for all u G (jl — 5, jl + 5) \ {jl}, there exists an 
optimal allocation, I*(ll) G Zcsra. that satisfies I*(p) G {0, \} NxKxM _ Moreover, if p\,pi G (jl — 5,jl), 
then there exists I* (lii) , I* (112) G {0, \} NxKxM such that I*(fi\) = I*{ii2)- The same property holds 
if both Lii,fi2 G (jl, ft + 5). 

Proof: A proof sketch is given in Appendix |Pl For the full proof, see [19]. ■ 
In conjunction with (IT2b . the above lemma implies that the discontinuities in Fig. [3] are isolated and 
that, around every point on the horizontal axis, there is a small region over which X^ oX (fi) is continuous. 
Hence, the number of such discontinuities are, at most, countable. 

IV. Scheduling and Resource Allocation without subchannel sharing 

In this section, we will solve the scheduling and resource allocation (SRA) problem CQ) under the 
constraint that I n ,k,m G {0, 1}, i.e., that each subchannel can be allocated to at most one combination 
of user and MCS per time slot. We will refer to this problem as the "discrete scheduling and resource 
allocation" (DSRA) problem. Storing the values of I n ^k,m in the N X K x M matrix i", the DSRA 
subchannel constraint can be expressed as / G Idsra> where 

Xdsra := {/ : J G {0, 1}"***", £ l nAm < 1 Vn} . 

k,m 

Then, using (OQ), the DSRA problem can be stated as 

DSRA := max V] 4,fc,m E I U nAm ((l - a/ c>m e" fefc " lP " fc - m7 " ' t )r fc>rn ) i s.t. V] In,k,mPn,k,m < p con- 

{fVfc,„«>o} 1 — 4 y > (r ..^ 

n,k,m n,k,m V*^) 

1 6-f-DSRA 



Let us denote the optimal I and p for (1341 ) by -^dsra an< ^ Pdsra' respectively. 

The DSRA problem is a mixed-integer programming problem. Mixed-integer programming problems 
are generally NP-hard, meaning that polynomial-time solutions do not exist [20]. Fortunately, in some 
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cases, such as ours, one can exploit the problem structure to design polynomial-complexity algorithms 
that reach solutions in close vicinity of the exact solution. We first describe an approach to solve the 
DSRA mixed-integer programming problem exactly by exhaustively searching over all possible user-MCS 
allocations in order to arrive at the optimal user, rate, and power allocation. We will see that this "brute- 
force" approach has a complexity that grows exponentially in the number of subchannels. Later, we will 
exploit the DSRA problem structure, and its relation to the CSRA problem, to design an algorithm with 
near-optimal performance and polynomial complexity. 

A. Brute-force algorithm 

Consider that, if we attempted to solve our DSRA problem via brute-force (i.e., by solving the power 
allocation sub-problem for every possible choice of I G 2^dsra)> we would solve the following sub- 
problem for every given i". 

max In,k,m E \ fn,fc,m,((l — Ofc.mC bk ' mP ' 1 " k - m ' t " • k )rk, m ) \ S.t. \j In,k,m Pn,k,m < -Peon- 

n,k,m n,k,m (-5-5) 

Borrowing our approach to the CSRA problem, we could transform the variable p nj k,m into x n ,fc,m via 
the relation: x n fc m = I n ,k,mPn,k,rn- The problem in d35l ) can, therefore, be written as: 

min 7 In,k,m Fn,k,m\In,k,rni %n,k,m) S.t. 7 %n,k,m — -Pcori) (36) 

{x n , k . m >0} ^ , 

n,k,m n,k,m 

where F n> k : m{In,k,m, £n,fc,m) is defined in dU). This problem is a convex optimization problem that satisfies 
Slater's condition [18] when x n ^, m = P C on/2NKM for all n,k,m. Therefore, its solution is equal to 
the solution of its dual problem (i.e., zero duality gap) [18]. To formulate the dual problem, we write 
the Lagrangian of the primal problem (|36l ) as 

Z//(/i,, a?) — ^ ^ In,k,m,Fn,k,m(In,k,rm %n,k,m) ~\~ ^ ^ %n,k,m -fcon^M' (37) 

n,k,m n,k,m 

where fi is the dual variable and x is the N x K x M matrix containing actual powers allocated to all 
(n, k, m) combinations. Note that the Lagrangian in (1371 ) is exactly the same as the Lagrangian for the 
CSRA problem in ©. Using (f37T >. the dual of the brute-force problem can be written as 

maxminLj(/i, x) = maxL;(/i,a;*(/i)) = Li(fi* I ,x*(/i* I )), (38) 

for optimal solutions fij and Minimizing Lr(/x, x) over {x y 0} by equating the differential of 

Li(/i,x) w.r.t. x nj k : m to zero (which is identical to the approach taken in Section UlI-AI for the CSRA 
problem), we get that, for any subchannel n, 

<,fe,m(M) = J n,fe,mPn,fc,m(M)- (39) 
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Here, 

( 

Pn,k,m 

(fi) if < fi < a k 

) E{7n,fc} 



Pn,k,m{n) 



(40) 

otherwise, 



and pn,k,m{fJ-) is the uniqudj value satisfying CfTTT) . repeated as (|4TT) for convenience. 

/u = a Km b Km r Km V{U' nAm ({l - a fcim e- bfc -^^--^"' fc )r fcim ) 7nifc e- bfc -P- fc "-^"' fc }. (41) 

Note that the Lagrangian as well as the power allocation in (I37T ) and (l39l are identical to that obtained 
for the CSRA problem in © and (fl2l . respectively. Also recall that (fT9T)-(l2Tb hold even when I*(fi) is 
replaced by arbitrary I. Thus, we have fj,} £ [/U m j n , /x max ], where ^ m j n and /x m j n are defined in (l20l and 
(|2TT >. respectively. 

As discussed in Section ITlI-AI p n ,k,m{^) is a strictly-decreasing continuous function of /i, which makes 
Pn k mil 1 ) a decreasing continuous function of \i. Let us now define 

^tOt( J )^) - Y X n,k,m(v) = Y J n,*."»Pn,fc,n»(A*) ( 42 ) 
n,k,m n,k,m 

as the total optimal power allocation for allocation i" at fx. Therefore, X* oi (I, //) is also a decreasing 
continuous function of |U. This reduces our problem to finding the minimum value of \i € [/U m in j /^max] 
for which X t * t (J, //) = P C on- Such a problem structure (i.e., finding the minimum Lagrange multiplier 
satisfying a sum-power constraint) yields a water-filling solution (e.g., [9], [21]). To obtain such a solution 
(in our case, one can use the bisection-search algorithm given in Table U 

While there are many ways to find \i, we focus on bisection-search for easy comparison to the CSRA 
algorithm. Then, to solve the resource allocation problem for a given i" £ Zdsra> the complexity, 
in terms of the number of times d4T~b (or CCD) is solved to yield jlj such that — fi>}\ < k, is 
( En,fc,m \ lo ^2 ( ^ max / mi " )J . Since the brute-force algorithm examines |X DSRA | = (KM + 1) N hy- 

potheses of i", the corresponding complexity needed to find the exact DSRA solution is [log 2 ( /Jmax ^ Mmin )] x 
En=i n {n)( KM T or > equivalently, 

^l og2 ^max-^min ^ x ^ RM + \)N-l NRM _ ( 43 ) 

Because this "brute-force" algorithm may be impractical to implement for practical values of K, M, 
and N, we focus, in the sequel, on lower-complexity DSRA approximations. In doing so, we exploit 
insights previously gained from our study of the CSRA problem. 

3 By assumption, U' n k,m{') ls a decreasing positive function and e~ 6fc m ^"' fc m '' l ' 7 " ' ,: is a strictly-decreasing positive function 
of p n ,k,m(^), which makes the right side of J4U a strictly-decreasing positive function of p n ,k,m(n)- 
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B. Proposed DSRA algorithm 

Equation ( f3QT > in Section IIII-BI demonstrated that there exists an optimal user-MCS allocation for 
the CSRA problem that either lies in the domain of DSRA problem, i.e., I* (p*) G Zdsra> or is a 
convex combination of two points from the domain of DSRA problem, i.e., I*(p-*) = XI mm (p*) + (1 — 
A)/ max (/i*), where I min (fi*) + J max (^*) and r in (fi*), I m!a (fi*) G X DSRA - (Note that if J G X CS ra and 
I G {0, \} NxKxM j then I G Zdsra-) This observation motivates us to attack the DSRA problem using 
the CSRA algorithm. In this section, we provide the details of such an approach. 

The following lemma will be instrumental in understanding the relationship between the CSRA and 
DSRA problems and will serve as the basis for allocating resources in the DSRA problem setup. 

Lemma 4. If the solution of the Lagrangian dual of the CSRA problem (J6]) for a given p, is such that 
I*(p) G {0, \} NxKxM t an( l the corresponding total power is X* oi (p) as in d^il ), then the solution to the 
optimization problem 

(P*,I*) = argmax V I n , fe , m E {u nAm {(l - a Km e- b ^ v ^^)r Km ) } s.t. V l n ,k,rr^n,k,m < ^totM 

L — ' n,k,m n,k,m 
Jg^dsra 



satisfies I* = I*(p) and, for every (n, k, m), 



n,k,m 



I^JJT) ^ I n,k,m^> ^ U 

otherwise. 



Proof: A proof sketch is given in Appendix |E] For the full proof, see [19]. ■ 
From the above lemma, we conclude that if a ^ exists such that I*(p) G Zdsra an d ^mil 1 ) = -fcon, 
then the DSRA problem is solved exactly by the CSRA solution (I*(p),x*(p,I*(p))), i.e., the optimal 
user-MCS allocation -f DS RA ec l ua l s ^*(m) an d trie optimal power allocation, P DS ra> f° r an y ( n ' ^> m )> i s 

Pn,k,m, DSRA ~ \ 

I otherwise. 

Recall that the optimal total power achieved for a given value of Lagrange multiplier p, i.e., Xf ot (p) = 
Z^n km x nk mil 1 ' (,"))> i s piece-wise continuous and that a discontinuity (or "gap") occurs at p when 
multiple allocations achieving the same optimal value of Lagrangian exist. When the sum-power con- 
straint, P C on, lies in one of those "gaps," the optimal allocation for the CSRA problem equals a convex 
combination of two elements from the set Idsra> an d the CSRA solution is not admissible for DSRA. 
In such cases, we are motivated to choose the approximate DSRA solution Jdsra £ {-^"""(m), -^"(a 4 )} 
yielding highest utility. In Table H we detail an implementation of our proposed DSRA algorithm that 
has significantly lower complexity than brute-force. The numerical simulations in Section [V] show that its 



June 30, 2011 



DRAFT 



18 

performance is very close to optimal. Moreover, the following lemma bounds the asymptotic difference 
in utility achieved by the exact DSRA solution and that produced by our proposed DSRA algorithm. 

Lemma 5. Let // be the optimal [i for the CSRA problem and p,p be such that fi* E \p,p]- Let Uq SR/k 
and C/dsra(/U, P) be the utilities achieved by the exact DSRA solution and the proposed DSRA algorithm, 
respectively. Then, 

< ^DSRA " I™ #DSRa(a*,£) < " A%in)(^con " X£,(I m "V), /i*)) (45) 

if \S n {u*)\ < 1 Vn 

< < • (46) 

(^max ~ A%iin)-fcon otherwise 

Proof: The proof is given in Appendix |F] ■ 
It is interesting to note that the bound (l46l ) does not scale with number of users K or subchannels N. 
The complexity of the proposed DSRA algorithm is marginally greater than that of the CSRA algorithm, 

since an additional comparison of two possible user-MCS allocation choices is involved. In units of solving 

(fTTT) for a given (n, k,m, p), the DSRA complexity is at most 

N(KM + 2) [log 2 ( agg=an!n )] . (47) 

Comparing d43l ) and (|47l ), we find that the complexity of the proposed DSRA algorithm is polynomial 
in N, K, M, which is considerably less than that of the brute-force algorithm (i.e., exponential in N). 



C. Discussion 

Before concluding this section, we make some remarks about our approach to DSRA and its connections 
to CSRA. First, we note that the DSRA problem is an integer-programming problem due to the discrete 
domain {0, 1} assumed for I n ^, m - Because integer programming problems are generally NP-hard (recall 
our "brute force" DSRA solution), one is strongly motivated to find a polynomial-complexity method 
whose performance is as high as possible. One possible approach is based on "relaxation," whereby 
the discrete domain is relaxed to an interval domain, the relaxed problem is solved (with polynomial 
complexity), and the obtained solution is mapped back to the discrete domain. In fact, relaxation was 
previously employed for OFDMA frequency-scheduling in [9], [11], and the DSRA approximation that 
we propose in Section IIV-BI can also be interpreted as a form of relaxation. 

The optimization literature suggests that relaxation is successful in some — but not all — cases, implying 
that relaxation-based OFDMA algorithms must be designed with care. For example, relaxation has widely 
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used to solve linear integer programs (LIPs) [22]-[24]. The DSRA, however, is a mixed-integer nonlinear 
program (MINLP), and for such problems relaxation does not always perform well [24], [25]. Now, one 
could cite the analysis in [26, p. 371], which shows that — for a broad class of integer programming 
problems — the duality gap goes to zero as the number of integer variables goes to infinity, to suggest 
that the DSRA problem can be well approximated by its relaxed counterpart, CSRA, as the number of 
OFDMA subchannels N — > oo. However, in practice, the number of subchannels N is often quite small, 
preventing the application of this argument. For example, in LTE systems [6], [27], each subchannel 
consists of 12 subcarriers, so that only 25 subchannels are used for 5 MHz bandwidths, and only 6 are 
used for 1.4 MHz bandwidths. 

The above considerations have motivated us to investigate, in detail, the relationship between the 
continuous and discrete resource allocation scenarios. The results of our investigation include insights 
into the dissimilarity between CSRA and DSRA solutions (e.g., Lemma [3] and Lemma HJ), and an efficient 
polynomial-complexity DSRA approximation that (as we shall see in Section |V) performs near-optimally 
for all N and admits the tight performance bound d46b . 

V. Numerical Evaluation 

In this section, we analyze the performance of an OFDMA downlink system that uses the proposed 
CSRA and DSRA algorithms for scheduling and resource allocation under different system parameters. 
Unless otherwise specified, we use the sum-goodput utility U ni k, m (g) = g- 

For downlink transmission, the BS employs an uncoded 2 m+1 -QAM signaling scheme with MCS index 
m € {1, . . . , 15}. In this case, we have r*. m = m + 1 bits per symbol and one symbol per codeword. In 
the error rate model efc,m(P7) = o-k,m^~ bk,mP1 , we choose (ik, m = 1 and bk, m = 1.5/(2 m+1 — 1) because 
the actual symbol error rate of a 2 m+1 -QAM system is proportional to exp(— 1.5p7/(2 m+1 — 1)) in the 
high-(j>7) regime [28] and is « 1 when jry = 0. We use the standard OFDM model [29] to describe the 
(instantaneous) frequency-domain observation made by the k th user on the n th subchannel: 

Un,k = h n> kX n + u njk , for n G {1, ... ,N} and k € {1, . . . ,K} (48) 

In (|48l , x n denotes the QAM symbol broadcast by the BS on the n th subchannel, h n & the gain of 
the n th subchannel between the k th user and the BS, and z/ n ,ifc a corresponding complex Gaussian 
noise sample. We assume that {V n ,/fc} is un it variance and white across (n,k), and we recall that 
the exogenous subchannel-SNR satisfies 7„ ^ = \h n ^\ 2 . We furthermore assume that the k th user's 
frequency-domain channel gains = (foi,fc> • • • > ^v,fc) T £ are related to its channel impulse response 
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9k = (di, kt ■ ■ ■ j 9L,k) T ^ C L via h k = Fg k , where F £ C NxL contains the first L(< iV) columns 
of the iV-DFT matrix, and where {gi t k} are i.i.d. over (l,k) and drawn from a zero-mean complex 
Gaussian distribution with variance a 2 g chosen so that E{7„ ^} = 1. Since the total available power for 
all subchannels at the base-station is P C on> the average available SNR per subchannel will be denoted by 
SNR = ^E{ 7n , fc }. 

To model imperfect CSI, we assume that there is a channel-estimation period during which the mobiles 
take turns to each broadcast one pilot OFDM symbol, from which the BS estimates the corresponding 
subchannel gains. Furthermore, we assume that the channels do not vary between pilot and data periods. 
To estimate h k , we assume that the BS observes y k = y'Ppiiot + u k 6 C^. Note that the average 
SNR per subchannel under pilot transmission is SNR p j| 0t = p p j| 0t ~Ej{^ nk }. The channel h k and the 
pilot observations y k are zero-mean jointly Gaussian, and furthermore h k \y k is Gaussian with mean 
V{h k \y k } = R hkt y k R^y k y k and covariance Cow{h k \y k ) = R hk , hk - R hki y k R^ yk Ry khk , where 
Rz u z 2 denotes the cross-correlation of random vectors Z\ and Z2 [30, pp. 155]. Since Rh k ,h k = a 2 FF' ', 
Rh k ,y k = ^PpWo^gFF' , and Ry kt y k = pp\\ \a 2 FF' + I (where I denotes the identity matrix), it 
is straightforward to show that the elements on the diagonal of Cov{h k \y k ) are equal. Furthermore, 
E{h k \y k } can be recognized as the pilot-aided MMSE estimate of h k . In summary, conditioned on 
the pilot observations, h n ^ k is Gaussian with mean h n ^ given by the n th element of E{h k \y k }, and 
with variance a 1 given by the first diagonal element of Cov(h k \y k ). Thus, conditioned on the pilot 
observations, 7„ & has a non-central chi-squared distribution with two degrees of freedom. 

We will refer to the proposed CSRA and DSRA algorithms implemented under imperfect CSI as 
"CSRA-ICSi" and "DSRA-ICSI," respectively. Their performances will be compared to that of "CSRA- 
PCSI," i.e., CSRA implemented under perfect CSI, which serves as a performance upper bound, and 
fixed-power random-user scheduling (FP-RUS), which serves as a performance lower bound. FP-RUS 
schedules, on each subchannel, one user selected uniformly from {1, . . . , K}, to which it allocates power 
Pcon/iV and the fixed MCS m that maximizes expected goodput. Unless specified, the number of OFDM 
subchannels is N = 64, the number of users is K = 16, the impulse response length is L = 2, the average 
SNR per subchannel is SNR = 10 dB, the pilot SNR is SNR pi | 0t = -10 dB, and the DSRA/CSRA tuning 
parameter is k = 0.3/P con (recall Table J]). In all plots, goodput values were empirically averaged over 
1000 realizations. 

Figure [4] plots the subchannel-averaged goodput achieved by the above-described scheduling and 
resource-allocation schemes for different grades of CSI. In this curve, SNR p j| 0t is varied so as to obtain 
estimates of subchannel SNR with different grades of accuracy. All other parameters remain unchanged. 
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Fig. 3: Prototypical plot of -X^tot (m) an d 
i(/x, I* (fi),x* (fj., I* (fJ,))) as a function of ji for 
N = K = 5, and P C on = 100. (See Section |V] for details.) 
The red vertical lines in the top plot show that a change 
in occurs at that p. 
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Fig. 4: Average goodput per subchannel versus SNR P j| t. 
Here, N = 64, K = 16, and SNR = 10 dB. 



The plot shows that, as SNR p j| 0t is increased, the performance of the proposed schemes (under the 
availability of imperfect CSI) increases from that of FP-RUS to that achieved by CSRA-PCSI. This 
is expected because, with increasing SNR p j| 0t , the BS uses more accurate channel-state information 
for scheduling and resource allocation, and thus achieves higher goodput. The plot also shows that, 
even though the proposed CSRA algorithm exactly solves the CSRA problem and the proposed DSRA 
algorithm approximately solves the DSRA problem, their performances almost coincide. In particular, 
although the goodput achieved by CSRA-ICSI scheme exceeded that of DSRA-ICSI scheme in up-to 49% 
of the realizations, the maximum difference in the subchannel-averaged goodput was merely 4 x 10~ 3 bits 
per channel-use (bpcu). Since the DSRA-ICSI schemes cannot achieve a sum-goodput higher than that 
achieved by the CSRA-ICSI scheme, it can be deduced that the proposed DSRA algorithm is exhibiting 
near-optimal performance. 

Figure [5] plots the subchannel-averaged goodput versus the number of available users, K, ranging 
between 1 and 32. It shows that, as K increases, the goodput per subchannel achieved by the proposed 
schemes increase under both perfect and imperfect CSI, whereas that achieved by the FP-RUS scheme 
remains constant. This is because, in the former case, the availability of more users can be exploited 
to schedule users with stronger subchannels, whereas, in the FP-RUS scheme, users are scheduled 
without regard to the instantaneous channel conditions. Similar to the observations in the previous plots, 
the performance difference between the proposed CSRA and DSRA algorithms remains negligible. In 
particular, although the goodput achieved by CSRA-ICSI exceeded that of DSRA-ICSI in up-to 29% of 
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Fig. 5: Average goodput per subchannel versus number 
of users, K. In this plot, N = 64, SNR = 10 dB, and 
SNRpiio, = -10 dB. 
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Fig. 6: The top plot shows the average goodput per 
subchannel as a function of SNR. The bottom plot shows 
the average bound on the optimality gap between the 
proposed and exact DSRA solutions (given in l !45t ). i.e., 
the average value of (ji* — p m m)(P C on — X^{I mm , fi*))/N. 
In this plot, N = 64, K = 16, and SNR pi | 0t = -10 dB. 



the realizations, the maximum difference in the subchannel-averaged goodput was merely 7 x 10~ 4 bpcu. 

In Figure [6j the top plot shows the subchannel-averaged goodput and the bottom plot shows the 
subchannel and realization-averaged value of the bound (l45l ) on the DSRA-ICSI optimality gap as 
a function of SNR. In the top plot, it can be seen that, as SNR increases, the difference between 
CSRA-PCSI and CSRA-ICSI (or, DSRA-ICSI) increases. However, the difference grows slower than the 
difference between CSRA-PCSI and FP-RUS. Interestingly, even for high values of SNR, the performance 
of CSRA-ICSI and DSRA-ICSI remain almost identical. In particular, although the goodput achieved by 
CSRA-ICSI scheme exceeded that of DSRA-ICSI scheme in up-to 28% of the realizations, the maximum 
difference in the subchannel-averaged goodput was merely 4 x 10~ 5 bpcu. The bottom plot, which 
illustrates the average value of (//* — /%iin)(-fcon — -^totC^™") A 4 *)) over au realizations and subchannels 
w.r.t. SNR, shows that the loss in sum-goodput over all subchannels due to the sub-optimality of proposed 
DSRA solution under imperfect CSI is bounded by 7 x 10~ 3 bpcu, even when the subchannel-averaged 
goodput of DSRA-ICSI is of the order of tens of bpcu. These results confirm that the bound d45l ) is quite 
tight at high SNR. 

Figure [7] shows the performance of the proposed DSRA algorithm under a sum-utility criterion that 
is motivated by a common pricing model for an elastic application such as file-transfer [3], [4]. In 
particular, we partitioned the K = 16 users into two classes: k G {1, . . . , 8} = KL\ is "Class 1" and 
k € {9, . . . , 16} = JC 2 is "Class 2," and we ran DSRA with the utility U k (g) = (1 - e^ 19 )!^, + 
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Fig. 7: The top plot shows sum utility versus Wi when 
W2 — 1, SNR = dB. The bottom plot shows the sum- 
utility versus SNR when wi = 0.85, 11)2 = 1. Here, N = 
64, K = 16, and SNR pi | 0t = -10 dB. 
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Fig. 8: The top plot shows the mean deviation of the 
estimated dual variable /j, from (j,*, and the bottom plot 
shows average sum-utility, as a function of the number of 
^-updates. Here, N = 64, K = 16, SNR = 10 dB, and 
SNRpii ot = -10 dB. 



(1 — e~ W29 )lk£K. 2 > where \g denotes the indicator of event 6. The utility can be regarded as the revenue 
earned by the operator: when wi > Wj , Class-i users pay more (for a given goodput g) than Class- j users 
in exchange for priority service. In Fig. [71 we show the resulting DSRA-maximized utility summed over 
all users, as well as that summed over each individual user class. For comparison, we show the utility 
(summed over all users) when DSRA is "naively" used to maximize sum-goodput instead of sum-utility. 
The top plot in Fig. [7J shows performance as a function of w±, for fixed W2 = 1 and SNR = dB. There 
the behavior is as expected: when wi < W2 = 1 (i.e., Class- 1 users pay much less) DSRA allocates the 
overwhelming majority of the resources to Class-2 users, in an effort to earn more revenue. Meanwhile, 
when w\^> 11J2 = 1, the overwhelming majority of resources are allocated to Class- 1 users. Moreover, it 
is evident that the naive goodput-maximizing scheme does not earn the operator as much revenue as the 
utility-maximizing scheme (outside of the trivial case that w\ = W2). The bottom plot in Fig. [7J shows 
the above described sum-utilities as a function of SNR, for fixed w\ = 0.85 and u? 2 = 1. There it can 
be seen that, at low SNR, the two classes achieve proportional utilities while, at high SNR, the utility 
of Class-1 users tend to zero. This behavior can be explained as follows: At low SNR, the goodputs g 
are small, in which case 1 — e~ Wi9 w wig, so that Uk{g) ~ Wiglk^K^ i- e -> weighted-goodput utility. At 
high SNR, this approximation does not hold because the goodputs g are usually large, and this particular 
pricing-based utility becomes increasingly unfair. 

In Figure [8j we compare the performances of our proposed algorithms to the state-of-the-art algorithms 
in [7], [9]. In particular, we first compare the golden-section-search based algorithm from [7] to our CSRA 
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algorithm. For CSRA, we choose the utility function and the SNR distributions to maximize the upper 
bound on capacity computed via the effective SNR -4 t,loe (l + -? — , I !"' k '^ h " ,k \t — r^) from \1 , Eq. 
(4)]. Second, we compare the subgradient-based algorithm proposed for discrete allocation in [9] to our 
DSRA algorithm. For DSRA, we choose the utility U n ^, m {g) = -w log(l — log(l — g)) Vn, k, m, so that 
we maximize ^ n fc E{log(l + p n ,k,ljn,k)}> as in [9]. The top plot in Fig. [8] shows the mean deviation 
of the estimated value of the dual variable // from the optimum (i.e., //*), and the bottom plot shows 
the total utility achieved as a function of the number of //-updates. For the subgradient-based algorithm 
in [9], we set the step-size in the / th //-update to be In the top plot, it can be seen that the proposed 
algorithms outperform the algorithms in [7], [9] and converge toward //* at a much faster rate. The bottom 
plot shows that the proposed algorithms achieve a much higher utility than the algorithms in [7], [9] for 
the first few //-updates, illustrating the speed of our approaches. Note that the golden-section algorithm 
only provides estimates of //* at even numbers of //-updates. 

VI. Conclusion 

In this paper, we considered the problem of joint scheduling and resource allocation (SRA) in downlink 
OFDM A systems under imperfect channel-state information. We considered two scenarios: 1) when 
subchannel sharing is allowed, and 2) when it is not. Both cases were framed as optimization problems 
that maximize a utility function subject to a sum-power constraint. Although the optimization problem 
in the first scenario (the so-called "continuous" or CSRA case) was found to be non-convex, we showed 
that it can be converted to a convex optimization problem and solved using a dual optimization approach 
with zero duality gap. An algorithmic implementation of the CSRA solution was also provided. The 
optimization problem faced in the second scenario (the so-called "discrete" or DSRA case) was found 
to be a mixed-integer programming problem. To attack it, we linked the DSRA problem to the CSRA 
problem, and showed that, in some cases, the DSRA solution coincides with the CSRA solution. For 
the case that the solutions do not coincide, we proposed a practical DSRA algorithm and bounded its 
performance. Numerical results were then presented under a variety of settings. The performance of the 
proposed CSRA and DSRA algorithms schemes under imperfect CSI were compared to those under 
perfect CSI and no instantaneous CSI (i.e., fixed-power random scheduling). In all cases, it was found 
that the proposed imperfect-CSTbased algorithms offer a significant advantage over schemes that do not 
use instantaneous CSI. Next, our DSRA bound was numerically evaluated and found to be extremely 
tight. We then demonstrated an application of DSRA to maximization of a pricing-based utility. Finally, 
our CSRA and DSRA algorithms were compared to the state-of-the-art golden-section-search [7] and 
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subgradient [9] based algorithms and shown to yield significant improvements in convergence rate. 
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Appendix A 

Sketch of proof for convexity of CSRA problem 

First, we show that I n ,k,mF n ,k,m{In,k,m, x n ,k,m) is convex in I ni k, m and x n>k , m - For this, consider the 
case when I nAm > 0. In this case, the Hessian of I n ,k,mF n ,k,m(In,k,m, Xn,k,m) w.r.t. I n ^ m and x nAm 
can be calculated and found to be positive semi-definite. Next, consider the case when I n ^, m = 0. To 
prove convexity in this case, we apply the definition of convexity, i.e., for any two points \ m' x nk m) 

(2) (2) 

and (I n km ,x n k m ) in the domain of CSRA problem and for any A G [0, 1], convexity means 

xrW p (ji 1 ) J 1 ) Un_\u( 2 ' f fr® -® 

A1 n,k,m r n,k,m I 1 n h,m ' x n,k,m j <~ V 1 A ) 1 n,k,m r n,k,m I 1 n ,k,m ' x n,fc,m 



n,k,m ' n,k,m 



'^nim + (! - WSJ F nAm {^ k , m + (1 " A) 1®^ , Xx^a + (1 - \)x®^) . (49) 

When one or both of \ m , 1^1 m) are zero > it i s straightforward to show that the above equation holds. 
Therefore, I n ,k,mF n ,k,m{In,k,m, x n ,k,m) is convex in I n ,k, m and x n>kjm . Consequently, it is a convex func- 
tion of i" and x. Since the primal objective function of the CSRA problem J2 n k m ^n,k,m Fn,k,m(In,k,m, x n,k 
is a sum of functions that are convex in / and x, it is also convex in I and x. 
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Appendix B 
Sketch of proof of Lemma Q] 

Suppose that \i\ < /12, where Mi>M2 £ [Mmin > Mmax] ■ With /i fixed, the minimization problem becomes 
= min L(ji,I,x) = min ( V] x n fc m - P con ) M + V] I n ,k,mF n k m (I n k m ,x n k m ) (50) 

n,k,rn n,k,m 

1 Gi-CSRA -« Gi-CSRA 

recalling ©. At m = I*{^2) and x*(fi2, i"*(M2)) are suboptimal values of and a?*(M, /*(//)), 

and at m = M2> and x*(mi, /""(mi)) are suboptimal values of /*(m) and x* (//,/* (//)). Therefore, 

L(Mi,I* (mi), x*(mi, I* (mi))) < L(Mi,r (M2), **(m2, i"*(/i 2 ))), and (51) 

l(m2, r (/i 2 ), x*(M2, r (M2))) < ^(/i 2) r (mi), x*(mi,.p(mi)))- (52) 

Adding (BIT) and d52l) . and evaluating the result, we get 

(Ml - M2) ( XI ^Am^l' r (^l)) ~ <,fc,m.(^2, I*(M2))) < 0- (53) 

Since mi < M2> we have X t * t (Mi) > X t * t (M2)- Therefore, X* oi (/j,) is monotonically decreasing in m- 

Appendix C 
Proof of Lemma [2] 

Proof: To compare the utilities obtained by the proposed CSRA algorithm and the exact CSRA solu- 
tion, we compare the Lagrangian values achieved by the two solutions. Recall [i* G [m, p] C [Mmin , Mmax] ■ 
Therefore, 

L(M%r( / u*),^(M*,r( / u*)))-L(^r^),a:*(M,r(M))) > 0, and 

L(M%r(^),^*(M%r(^)))-L(/i,I*(A),^(M,r(M))) > O. (54) 

The solution of the proposed CSRA algorithm allocates resources such that the sum-power constraint is 
satisfied while achieving a Lagrangian value of 

icsRA = \L(p, i*(p,),x*(fi,r(fi))) + (i-\)L( b r (m),**(m, **(/£)))• 

For any jjL, notice that L(fj,,I*(fx),x*(fj,,I*(fx))) = -U*(fi) + (X^(fj,) - P CO n)n, where U*(fi) is the 
total utility achieved due to optimal power allocation at that fi. Since the resource allocation obtained 
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by the proposed CSRA algorithm and the exact CSRA solution satisfy the sum-power constraint with 
equality, we have 

t^CSRA = -L(jj,*,r(ii*),x*(jjf,r(ji,*))), and (55) 
^CSRA = -#CSRA(>,A0 + (Xtatifi) ~ P con)^fl + (^"tot(^) ~ ^con)(l ~ A)/X 

= -^csra(m, A) + (-XtotO*) " P con)(fl ~ M)A. (56) 
Equation (j56]l holds since XX* ot (fi) + (1 - A)X t * t (/£) = P con - From ([55]) and ([56]>, we get 

< C^CSRA " ^CSRa(/£,M) = —L(ji*,I*(p*), X* (fi* , I* (jJ,*))) + LcSRA " (*tot(A) " ^con)(A " /f)A. 
From the above equation and d54l ), we have 

< ^CSRA " ^CSRA(/i, A) < (^con " X* ot (jl))(jl - fJ,)X < (fl - t )P cm . (57) 

■ 

Appendix D 
Sketch of proof of Lemma[3] 

Let fl G [jUmin , ^max] be any value of the Lagrangian dual variable for the CSRA problem. Then, at ft, 
one of the following three cases holds. 

1) \S n (fl)\ < 1 Vn. 

2) For some n, \S n (p,)\ > 1 but no two combinations in S n (fi) have the same allocated power. 

3) For some n, \S n (fi)\ > 1 and at least two combinations in S n (jl) have the same allocated power. 
We make use of two properties in the proof. Firstly, V n: k,m(lJ;Pn & m (/■*)) is a continuous function 

of \x. Therefore, by definition of continuous functions, if V n ^, m {fi'iPn k m(A)) > 0> tnen we can & x a 
^n,fc,m (> 0) such that V^ m (/i,p* fe m (/i)) > whenever |/x - /x| < 6n,fc,m- Secondly, for all values of 
u, we know dVn < k > m ^^. k <™^ = p* fcm (/z). We now apply these properties to each of the three cases 
to determine S n {fj) Vn. When u is sufficiently close to /2> we show that, in cases 1) and 2), one can 
fix a (5 such that < 1 Vn whenever < |u — jj,\ < 5. When this happens, it can be shown 

that, for all u l5 u 2 G (A - 6,p), one has I* (fix), I*(u 2 ) G {o, l} WxA ' xM and S n ( W ) = <S n (u 2 ) Vn. 
The same property holds when fj,x, u 2 G (A, A + 5). In case 3), we establish that all combinations with 
the same allocated power contribute equally to the total power allocated, as well as the total optimal 
value of Lagrangian. Therefore, all but any one combination can be ignored safely, implying that there 
exists a fixed 5 such that !"*(") G {0, \} NxKxM whenever — p\ < 5. After ignoring the redundant 
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combinations, it follows from cases 1) and 2) that, for all /ii,/i2 G (P ~ <^> A) an d W>/^2 G (A) A* + 
there exists I* (>i), I* (// 2 ) G {0, i}^x^ xM such ^ r ^ = jr*^ 

Appendix E 
Sketch of proof of Lemma [4] 

From (O and the stated assumptions, we have € ^dsra C Xqsra an d 

(l*(fi),x*(n,I*(n))\ = argmin ^ I n ,k,mFn,k,m(In,k,m,x n ,k,m) + ( ^ ~ P conV, (58) 

3; — " n,k,m n,k,m 
JgZdsra 

where F n fc m (v) was defined in (0]). Then, applying the concept of generalized Lagrange multiplier 
method from [31, Theorem 1], we conclude that 

(T,X*) = argmin ^ In,k,mF n ,k, m (K,k,m,^n,k,m) s.t. ^ X n)fc , m < ^ x* fcm (/x, I*(/i)).(59) 

{^^0} n,k,m n,k,m n,k,m 

IgZdsra 

Substituting X n fc jm = 1^ fc )fri Pn,A;,m back into the above equation, we obtain the desired result. 

Appendix F 
Proof of Lemma [5] 

Proof: Let us denote lim^-^ J7dsra(m> P) by ^dsra- The left inequality in the lemma is straight- 
forward since ^d SRA > ^dsra(^,^) V/x, fx. Now, if \S n (n*)\ < 1 Vn, then we have J7q SRA = ^csra = 
Cdsra* ensuring that the solution obtained via the proposed DSRA algorithm is optimal in the limit 
H,p, ->■ /i*. However, when |5 n (/x*)| > 1 for some n, -Peon lies in one of the "gaps" as mentioned in 
Fig. [3] and /qsra ^ x dsra- In this case, we have < U qsra ~ ^dsra < ^csra ~ ^dsra- Let 17* (J) be 
the optimal utility achieved for user-MCS allocation matrix I £ Xqsra- We recall from S ection ITlI-C I that, 
at //*, the allocation i" min (/i*) is one of possibly many values of i" minimizing L(li*,I,x*(/j,*,I)). Thus, 
U£ SRA = -L(jf, I min (n*), x*(jf, I min (fi*))). For brevity in this proof, let us denote I min (//*) and i" max (/i*) 
(G ^dsra), defined in 413, by I""" and I max , respectively. Therefore, £/ DSRA = max{£/*(I min ), [7*(I max )}. 
This gives us 

^CSRA ~ ^DSRA < ^CSRA _ U*(I mn ) 

= -L(if, r in ,x*(jS, r in ) ) + Ljmiu (fi* min , x* (p* min ) ) ) 

= -L(/x^/ m ^a:*(/i^/ mi^ )) + L(^ min ,/ m ^ ; r*(^ min ,/ mi^ )), m 

where, for (l60b . we use the equivalence between L(fj,,I,x) in (|5]) and Lj(^j,,x) in (I37T ). Note that 
A*jmin < since the total optimally allocated power for J min at /i = /x* is less than or equal to P con and 
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the total optimally allocated power for any given J is a decreasing function of fi. Plugging L(-, ■, •) from 
© into d60j), we get 



^CSRA - ^DSRA < - - A^-fcon + J] I%k,m( ~ U n,k,miPn,k,m(^*)) + P*PnJi,Jjt) 



(61) 



n,k,m 



+ 



M/min-Fcon + ^ ] In,k,m\ ^n,k,m(P n: k,m(^I min )) + A* (-^ )Pn,k,m(l J 'I n 



n.k.m 



where, f7 n ,A. jm (x) = E{J7 njfc)m ((l - a fcjm e bk - mX "'"- k )r ktm ) }. Using the definition of X t * ot (I,//) in |@2), 
we have X t * ot (I min , /i*) < P con and X t * ot (I min , /i* min ) = P con . Therefore, dgT) can be re-written as 



17, 



CSRA 



t^DSRA 



< (J,* (P C on — -^tot ( J" m ' n , //*)) — In"k,m ^n,fe,m(Pn,fe,m (M/min)) ~ U ny k,m{P n ,k,m{^*)) 



(62) 



n.k.m 



Calculating the first two derivatives of U n ^, m {x) with respect to cc, we find that it is a strictly-increasing 
concave function of x. Therefore, if xi < x 2 , one can write that U n ,k,m(x2) — Un,k,m(%i) — ( x 2 — 
x i)K,k,m( x 2)- Plugging x x =p* n km (fi*) and x 2 = P* nAm {^) into this inequality, we get 

^n,fc,m(Pn,fc,m(M/min)) - U n ^,m{Pn,k,m(P*)) - (Pn,fc,m(/4 min ) ~~ Pn.fc.mO^*)) 



(63) 

From (l62l) and (l63l) . we then get 

^CSRA ~ ^DSRA 

< (-Peon — X? oi (I mm , //)) — 7™fc )m ^,i^m(Ki,fc,ro(^^ 

n,k,m 

Evaluating J< )fc> J/^)), we find 



dU nt k,m(x) 



dx 



=K,*,,m(^min) 

^ /-^min- 



From (1641 ) and (I65I ). we finally obtain 

^CSRA ~ ^DSRA < (A 4 * - Mmin)(Pcon - X? oX (I mm , fi*)) < (//max - Mmin)Pcon- (66) 
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